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THE LARGEST COMPONENT IN A SUBCRITICAL RANDOM 
GRAPH WITH A POWER LAW DEGREE DISTRIBUTION 

By Svante Janson 

Uppsala University 

It is shown that in a subcritical random graph with given ver- 
tex degrees satisfying a power law degree distribution with exponent 
7 > 3, the largest component is of order n 1 '" -1 '. More precisely, the 
order of the largest component is approximatively given by a simple 
constant times the largest vertex degree. These results are extended 
to several other random graph models with power law degree distri- 
butions. This proves a conjecture by Durrett. 

1. Introduction. Random graphs where the asymptotic distribution of 
the vertex degrees has a power law tail have been the focus of much interest 
during the last decade or so, including both theoretical studies and various 
applications; see, for example, the books by Durrett [7] and (from a physical 
point of view) Dorogovtsev and Mendes [6]. 

To be precise, we will in this paper consider uniformly distributed random 
graphs with a given degree sequence, defined as follows. (See Section 4 for 
extensions to some other random graph models.) Let n £ N and let (di)™ be 
a sequence of nonnegative integers. We let G(n, (di) n ) be a random graph 
with degree sequence (di) n , uniformly chosen among all possibilities (tacitly 
assuming that there is any such graph at all; in particular, di has to be 
even) . 

We consider asymptotics as n — > oo, and thus we assume throughout the 
paper that we are given a sequence (di)™ for each n G N (or at least for some 
sequence n — > oo); di may depend on n but for notational simplicity we will 
not show the dependency on n explicitly. (Similarly, we often omit the index 
n on other parameters and variables below.) 

We let the random variable D = D n be the degree of a randomly chosen 
vertex in G(n, (di)™). Thus ¥(D = k) = n^/n, where := \{i : di = k}\ is the 
number of vertices of degree k in G(n, (di)™). 
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We say that G(n, (c£i)i ) has a power law degree distribution with exponent 
7 if P(-D n = A;) — ► as n — > oo for every A; > and some probability dis- 
tribution (pfc)o°> an d this asymptotic degree distribution satisfies pk ~ cA~ 7 
as k — > oo for some c > 0. [This is thus a doubly asymptotic notion, letting 
first n — > oo and then k — > oo. Actually, the theorems below will not assume 
this; we will use the weaker inequality (1.3) instead, but, on the other hand, 
we have to impose uniformity in n. Nevertheless, the reader ought to think 
mainly of the case of a power law degree distribution.] Note that necessarily 
7 > 1, since otherwise (p^) would not be summable. 

The existence of a giant component (i.e., a component of order n) in 
G(n,(di)i) has been studied by Molloy and Reed [15, 16]; see also Janson 
and Luczak [11]. To state their result we let C\ be the largest component of 
G(n, (di)i) and |Ci| its order. We further introduce the two parameters 

^ oo 1 n 

(1.1) fi = fi n :=ED n = - y2kn k = - Y"^, 

fc=0 i=l 
Y oo 1 n 

(1.2) v = v n := — V Hk - l)n k = — V - 1); 

p is thus the average degree and v is the mean number of offspring in the 
usual branching process approximation of the local structure starting at 
any fixed vertex; see, for example, [7]. Loosely speaking, the condition by 
Molloy and Reed [15] for existence of a giant component is v > 1. More 
precisely, if we assume as above that n k jn — > p k , k>0, then, under suit- 
able conditions ensuring uniform summability, fj, n — > /jLqo := Ylk^o^Pk and 
v n u oo '■= A^oo 1 J2k=o k(k — l)pk, and (under weak additional technical as- 
sumptions; see [11, 15]), if isoc > 1, then there is a constant p > such that 
\Ci\ = pn + o p (n), while if < 1, then = o p (n). (We use o p and O p 
in the standard way; see, e.g., Janson, Luczak and Rucinski [12]. For ex- 
ample, \C\\ = o p (n) means that P(|Ci| > en) — > as n — > oo for every e > 0; 
equivalently, |Ci|/n — 0.) 

Returning to power law degree distributions, we see that if 1 < 7 < 2, 
then poo = 00, and if 2 < 7 < 3, then p^ < 00 and = 00; in both cases we 
expect a giant component. (We will not study these cases further.) In the 
sequel, we will assume 7 > 3, and thus < 00. (Note that in this range, 
the condition > 1 is determined mainly by the values of p\~ for small k, 
such as k = 1, and is essentially independent of the behavior of p k for large 
k and in particular of the value of the exponent 7. Relations between 7 and 
the existence of a giant component that have been reported in the literature 
are artifacts due to the assumption of specific forms of p k also for small A;.) 

We will in this paper consider the subcritical case z^, < 1, when |Ci| is of 
smaller order than n, and try to estimate its order more precisely. Durrett 
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[7] observed that while it is well known that for a subcritical Erdos-Renyi 
random graph \C\\ = O p (logn) (see, e.g., [1], Chapter V.2), this fails for 
graphs with power law degree distributions. In fact, typically the largest 
vertex degree is of order n 1 /^ 7-1 ), and obviously the largest component is 
at least as big. Durrett ([7], Conjecture 3.3.1), conjectured (for a slightly 
different random graph model; see Section 4.1) that this is the correct order 
of the largest component. In the present paper, we prove this conjecture. 
A related result has recently been shown by Pittel [20], who showed that 
under the stronger condition W(D n = k) = 0(k~" / ), which implies that the 
maximum degree is Oin 1 /' 1 ), the conclusion \C\\ = O p (n 1 ^ \ogn) holds. 

All unspecified limits below are as n — > oo. We say that an event holds whp 
(with high probability), if it holds with probability tending to 1 as n — > oo. We 
let C\,C%... denote unspecified constants, sometimes depending on other 
parameters but never on n. 

Theorem 1.1. Consider the random graph G(n, (dj)™), with (dj)™ = 

(d[ n ^)i as above, and let C\ be its largest component. Suppose, using the 
notation above, that [i n — > [i^ > and u n — > < 1. Suppose further that 

(1.3) F(D n >k) = 0(k 1 -^), 

uniformly in n and k > 1, for some 7 > 3. Then there exists a constant A 
such that \C\\ < ^n 1 ^ 7-1 ^ whp. 

Remark 1.2. It is not necessary to assume that a limit distribution 
(pk) exists, and the assumptions fj, n — > /-too > and v n — > < 1 may be 
replaced by liminf [i n > and limsupf n < 1. The constant A depends only 
on liminf /x n , limsupi^ n and the constant C\ implicit in (1.3). 

Let A = A n := maxj d{, the maximum degree of G(n, (di)" ). Note that the 
assumption (1.3) implies 

n^ 1 < F(D > A) < C1A 1 - 7 

and thus 

(1.4) /\<C 2 n l l^'- l \ 
We can sharpen Theorem 1.1 as follows. 

Theorem 1.3. Under the assumptions of Theorem 1.1, 

(1.5) \C 1 \ = T ^ + o p (n 1 /(^)). 
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As said above, typically we expect A = A n to be of the order n 1 / , and 
then Theorem 1.3 shows that \C\\ is of this order also. The factor 1/(1 — v) 
in (1.5) is exactly what the standard branching process approximation of 
the local structure suggests: If we start at a vertex v* with degree A, it has 
A neighbors; each of them has on the average about v further neighbors, so 
there are about vA vertices of distance 2 from v*. Continuing, we expect 
to find about u^~ 1 A vertices of distance j from v*, and thus in total about 
A/(l — v) vertices in the component containing v*; see further Section 3. 

We extend Theorems 1.1 and 1.3 to some other random graph models in 
Section 4. 

We have not attempted to find any sharper estimate of the error term in 
(1.5), and leave this as an open problem for the reader. This is especially 
interesting when A = o{n l /^~ 1 ^). For example, as said above, Pittel [20] 
studies a special case of the situation above with A = 0(n 1 / 7 ), and shows 
that under his assumptions \C\ \ = O p (n 1 / 7 log n). 

Remark 1.4. The proof in Section 3 is easily modified to show that if 
Cj is the jth largest component and A^ is the jth largest vertex degree, 
then 

AU) ',V<7-lh 



\Cj \ = ho p (n 

1 — v 



for every fixed j ' > 1. Moreover, it follows that, for every fixed J, whp the J 
vertices with largest degrees belong to different components, each of them 
being of order roughly (1 — v)~ l times the degree of that vertex. If these 
vertex degrees are sufficiently well separated, the largest component thus 
contains the vertex with largest degree, and so on (for at least J — 1 com- 
ponents), but, of course, this can fail if two of the degrees are very close to 
each other. 



Remark 1.5. As is customary, we have concentrated on the number of 
vertices in the components in these results. The proofs show that the same 
results hold also if we consider the number of edges in the components. 
Consequently, the difference between the numbers of edges and vertices in 
the largest component is o p {n l /^~ 1 ^). 

2. Proof of Theorem 1.1. It will be convenient to work with multigraphs, 
that is, to allow multiple edges and loops. More precisely, we shall use the fol- 
lowing standard type of random multigraph: Let n € N and let (di)™ be a se- 
quence of nonnegative integers such that Yh=i d% is even. We let G*(n, (di)™) 
be the random multigraph with given degree sequence (dj)", defined by the 
configuration model (see, e.g., Bollobas [1]): take a set of di half-edges for 
each vertex i, and combine the half-edges into pairs by a uniformly random 
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matching of the set of all half-edges (this pairing is called a configuration); 
each pair of half-edges is then joined to form an edge of G*(n, (dj)i)- 

We obtain G(n, (di)™) by conditioning the multigraph G*(n, (di)i) on be- 
ing a (simple) graph. It is well known that the conditions in Theorem 1.1 
imply that 

(2.1) liminf F(G*(n, (d*)?) is simple) > 0, 

n — »oo 

and thus it suffices to prove the result for the random multigraph G*(n,(di)i). 
(In fact, by Janson [9], a necessary and sufficient condition for (2.1) is 
EL>2 = 0(ED n ), or equivalently u n = 0(l).) 

We thus consider the random multigraph G* (n, (dj)™), and prove Theo- 
rem 1.1 for this case. Fix a vertex v and explore the component C(v) contain- 
ing v by the standard exploration process. This process starts by declaring v 
used and all half-edges at v active. Then, proceed as follows for z = 1,2,..., 
as long as there is any active half-edge: Take an active half-edge, say X\. (In 
this proof it does not matter which one we choose; we may use any deter- 
ministic or random rule. Later it will be convenient to use the breadth-first 
version where the active half-edges are processed in order of appearance.) 
Find its partner ?/, in the configuration [thus Xiyi is an edge in G*(n, (di)i )], 
and let vi be the other endpoint of the edge Xiyi, that is, the endpoint of yi. 
If the vertex v% is not already used, declare the remaining d(vi) — 1 half-edges 
at V{ active. Finally, declare Xi, yi and Vi used. Repeat. 

Let Si be the number of active half-edges after i steps. Thus So = d(v) 
and Si = Si— i + £j — 1, i > 1, where £j is the number of new half-edges found 
at step i. We have £j = d(vi) — 1 if Vi is not already used; otherwise & = 0. 

The process stops at r := min{i : Si = 0}; we then have found r edges, so 
the component C(v) has r edges and at most r + 1 vertices. For definiteness, 
we let £j = for i > t. 

In particular, for any integer M > 0, if \C(v)\ > M, then r > M so the 
process lives at least until M and < 5a/ = d(v) + J2i=i(£i ~ 1) an d thus 

AI 

(2.2) d(v) + Y,ti>M. 

i=l 

We reveal the random configuration only as we need it. Equivalently, we 
may construct the random configuration during the exploration, by choos- 
ing each partner yi uniformly among all half-edges except Xj and the ones 
already paired. The numbers £j then become random variables. The random 
variables are dependent, but we can approximate them by independent 
random variables. More precisely, we bound them as follows. 

Suppose that i < y/n, say. (Our components will be much smaller.) If 
£i = k > 1, then d{vi) = k + 1, so there are at most n^+i = n¥(D = k + 1) 
possible choices for Vi (excluding the used ones) and thus at most (k + 
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l)nk+i = n{k + l)P(Z) = k + 1) choices of the half-edge yi. When choosing 
Hi, we have revealed the pairings of 2{i — 1) half-edges and chosen Xj, but 
j/i is uniformly distributed over all ^?=i4? — 2(i — 1) — 1 = n/i — 0(n 1//2 ) 
remaining half-edges. Hence, given any history of the exploration process up 
to step i, 

n(k + 1)P(Z) = k + 1) 
nn — (Jin 1 '*) 

(2.3) 

= (t + l)P( = t + l) 7 
A* 

Fix e > with 4e < 1 — z^, and let i/ = i/ + e. We consider only n that 
are so large that v = u n < 1 — 3e, and thus v 1 < 1 — 2e. Let X be an integer- 
valued random variable with the distribution given by 

(2.4) nx> X )=mJiXY, (k+mD = k+1) \ *>0. 

If n is large enough (not depending on i or fc), the factor 1 + 0{n~ 1 / 2 ) 
in (2.3) is less than v' /v, and thus by (2.3) and (2.4), given any previous 
history, 

P(& > x) < F(X > x) for every x > 0. 

In other words, £ is stochastically dominated by X. In particular, this holds 
given £i, . . . , £j_i, and it follows that the sequence £i, £2, • • • rnay be coupled 
with a sequence of independent random variables Xi with Xj, = X, such that 

(2.5) &<X» l<i<Vn. 
Furthermore, trivially, 

(2.6) & < A, i > 1. 

The remainder of the proof will only use the properties (2.5) and (2.6) of 
(&)• 

We note first that, by (2.4) and the assumptions of the theorem, 
EX _ f P(X > ra) < ± £ f ^±M^i±i> 

m=l ^ m=l k=m ^ 

v' ~ , (fc + 1)P(D = fc + 1) 1/ , 

17 tl V 
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and, for all x > 0, with m := \x \ . 



nx> x) <"-± <*±w=* + 

k=m 



/' 



(2.8 



<C 3 2 kF(D = k) 

k=m+l 

= C 3 (mF(D>m)+ £ P(Z>>j)J 

V j=m+l ) 

oo 

j=m+l 
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Let M = An 1 '^ ^ and M\ = n 1 '^ ^ 5 for some positive constants A 
(large) and 5 [small, and in particular less than 1/(7 — 1)] to be chosen 
later, and let := X;lpQ < Mi]. Thus, by (2.7), EY* < EX < v' . 

If \C(v)\ > M, then (2.2) holds and thus, using (2.6), 

M M M 

M < d(v) +Y t Zi<& + Y l Zil[X i < Mi] + ]T > Mi] 

i=l i=l i=l 

M I M \ 

(2.9) <Y,Yt + A l + ^l[Xi>Mi] 

i=i V i=i / 

M / M \ 

<^ , M + ^(y J -Ey J ) + A i + ^i[Xi >Mi] 

i=l V i=l / 

and thus, recalling 1/ < 1 — 2s, 



>(|C(i;)| > M) < P ]T(F - EVj) > eM 



(2.10) 



N.l=l 



+ p(^l[Z i >Mi]> £ --lj. 



Note that both sums on the right-hand side are sums of i.i.d. random vari- 
ables, so we can, and shall, use simple standard estimates for them. 

For the first sum in (2.10), fix a number r > 7 such that rS > 2. Then, 
by Rosenthal's inequality ([8], Theorem 3.9.1) (with constants depending on 

r), 



E 



M 



< C 6 M r/2 (E|yi - EYi| 2 ) r/2 + C 7 ME|Yi - EY"i| 
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(2.111 



< C 6 M r / 2 (EY?) r / 2 + C 8 MEY{. 
We estimate the moments of Y\ and obtain using (2.8) 



EYf 



2xP(Y 1 > x) dx 



(2.12) 



Mi 



2x¥(X > x) dx 
r Mi 



<i + c 5 



2x 3 " 7 dx < C 9 M 1 



(rather coarsely and because 7 > 3), and trivially 



(2.13) 



EY{ < M[. 



Consequently, by (2.11)-(2.13) and Markov's inequality, and recalling M\jM ■ 



M 



u=l 



i=l 



(2.14) 



< C W (EY?/M) r / 2 + CnM^'EYl 

< C 12 (M 1 /M) r/2 + Ci 3 M(Mi/M) r 

< C lA n- r& ' 2 + dsn 1 -*' 8 = oin- 1 ). 



For the second sum in (2.10), write Ii := l[Xi > M\] and note that for 
any positive integer L, since the Ii are i.i.d., 



M 
L 



It, = 1) 



(2.15) 



= (l) P(/i = 1)L 

<M l P(X>Mi) l . 
Moreover, by (2.8) again and the choice of M, 

MP(X > M x ) < c B A7iV(7-i)+(2-7)(i/(7-i)-«) = CAn^ 5 -^/^ . 

Choose <5 > such that Si := (7 - 3)/(7 - 1) - (7 - 2)5 > 0. Thus MF(X > 
M 1 )=0(n~ Sl ) and, by (2.15), 



(2.16) 



-LSi 
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Now choose L > l/<5i, and then A > C2(L + l)/e. Thus, recalling (1.4), 
(2-17) ^>^r>{L + l)/e. 

It follows from (2.16) that 

\ / M \ 

0{n~ L ^)=o{n~ 1 ). 



Consequently, (2.10) shows that, with this choice of A, P(|C(u)| > M) = 

o(n ). Since this holds for every vertex v, it follows that whp |C(u)| < M 

1/(7—1) 

for every vertex v, and thus \Ci \ < M = An 

3. Proof of Theorem 1.3. We continue to consider the random multi- 
graph G*(n,(di)i) and prove Theorem 1.3 for this case, which as above 
implies the result for G(n, (di)™) also. 

We begin with the upper bound, \C\\ < (1 — v)~ x A + o p (n 1// ( 7-1 )). Fix 
e > 0. Say that a vertex is fat if its degree is at least M2 := en 1 ^ 7-1 ) , and 
say that a component is bad if it contains at least two fat vertices. By (1.3), 
the number of fat vertices in G*(n, (di)™) is 

(3.1) n¥(D > M 2 ) < CmM^ 1 = Cie 1 " 7 = C 16 . 



Lemma 3.1. There is whp no bad component. 

Proof. Let, as in Section 2, M = An l /^~ l \ with A so large that whp 
\C\\ < M. Consider a given vertex v and the exploration process in Section 2. 
By (3.1), the number of fat vertices is at most C\q and thus the number of 
half-edges leading to a fat vertex is at most C16A. Consequently, at each 
step % of the exploration process, the probability of choosing a half-edge 
Ui leading to a fat vertex V{ is at most CiQA/(nu — (2i — 1)) and thus the 
probability of finding a fat vertex at some step i < M is at most, using (1.4), 

M ° mA = 0(n 1 /(7-D+i/(7-i)-i) = o(1) . 
npL — 2M 

Thus, for any given v, whp the component C(v) contains no fat vertex found 
before step M, except possibly v itself. On the other hand, we have chosen 
M such that whp |C(u)| < M; hence, for every given v, whp C(v) contains 
no fat vertex besides possibly v. 

We apply this starting with a fat vertex v. Since the number of fat vertices 
is bounded by (3.1), it follows that whp none of the components C(v) with v 
fat contains a second fat vertex, so none of them is bad. On the other hand, 
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every bad component has to contain a fat vertex v, and thus equal C(v) for 
some fat v. Consequently, whp there is no bad component at all. □ 

Let 5, M\, 5i and L be as in the proof of Theorem 1.1, but replace M by 

M' : = (1 - i/)- 1 (A + (L + ljen 1 ^ 7-1 )) < Cnn 1 ^^. 

Further, let M 2 := en 1 ^ 7 " 1 - 1 as above. Consider once more the exploration 
process started at a given vertex v. If \C(v)\ > M' and C(v) is good, then at 
most one of the degrees d(v),d{vi), . . . , c£(i>m') is greater than Mi, and that 
degree is at most A. Consequently, with L L := lpQ > Mi] as above for i > 1, 
and Io := 1 and vq := v, 

M' M' I M' \ M' 

d( v ) + J2 < E J *) < a + hr / 4 - 1 m 2 = a + m 2 

i=l i=0 \i=0 ) i=l 

Consequently, in this case (2.9) may be improved to 

M' M' 

M' < £ £il[Xi < Mi] + A + M 2 J2 T i 

1=1 1=1 

M' M' 

< v'M' + J2(Xi~ EY i) + A + M 2 E li 

i=l i=l 

and thus 

Af Af 

^(y, - EYj) + M 2 It > (1 - v')M' - A = (e + La) 

i=l i=l 



n l/(7-D. 



AT 



hence either 

(3.2) ^(Y; - Er 4 ) > > eC^M' 

i=l 

or, recalling M 2 := en 1 ^' y ~ 1 \ 

A/' 

(3.3) E J ^ L - 

i=i 

By (2.14) (replacing e by e/Cn) and (2.16), both events (3.2) and (3.3) have 
probabilities o(n _1 ). Consequently, 

P(|C(u)| > M' and C(v) is good) = o(n _1 ). 

Since this holds for every given vertex v, whp \C(v)\ < M' for every good 
component which together with Lemma 3.1 shows that whp 

\ Cl \ < M' = A + L + 1 en 1 ^- 1 ) < -A_ + C 18e nV(7-i) 
1 — 1/ — e 1 — 1/ — e l — i/ 
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(with Ci8 not depending on s). Since e can be chosen arbitrarily small, this 
shows that \C\\ < (1 — v)~ 1 A + o p (?i 1 ^ 7-1 ^), which is the upper bound we 
sought. 

To obtain a corresponding lower bound, let v* be the vertex with max- 
imum degree (choose any of them if there are several), and consider the 
component C(v*) containing v* . 

Let Nj be the number of vertices of distance j from v* . As said in the 
Introduction, the idea is that (by the usual branching process approxima- 
tion) Nj ~ v 3 ~ d(v*) = z/- ?_1 A, and summing over j > 1 yields ~ (1 — v)~ l A. 
In order to make this precise, we for simplicity truncate and consider only 
generations j < J and vertices of degree < K, where J and K are two fixed 
large integers. We thus let Njk be the number of vertices of degree k and 
distance j from v* . We find these by the exploration process in Section 2, 
now specifying the breadth-first version. Let 7ij, j > 0, be the set of active 
half-edges when we have processed all half-edges at vertices of distance less 
than j from v* ; thus Tij is the set of half-edges that may lead to a vertex of 
distance j + 1. We have \Hq\= d{v*) = A. 

Let, for k > 1, 

k¥(D = k) kn k 

Pk ■= = > 

fi /in 

the fraction of half-edges that are adjacent to a vertex of degree k (this is 
the size-biased distribution of D), and note that, by (1.1) and (1.2), 

oo oo 

5>;s=i, $>-iK=i/. 

k=l k=l 

Assume v > and let e > be so small that s < v. (The case v = is 
simpler and is left to the reader.) Let vk '■= J2i (k ~ ^-)p% an< ^ assume that K 
is so large that > e and Yli Pt > 1 — £■ Further assume that A > (logn) 2 
(otherwise the result is trivial), and assume below that n is large whenever 
needed. 

We may assume that the exploration process stops before it reaches 
^4 n 1 /(7~ 1 ) steps for some large A, either by Theorem 1.1 or because the lower 
bound otherwise is trivial. Then, at every step i there are — ©(n 1 ^ 7-1 ^) = 
n¥(D = k) — o(n) unused vertices of degree k and thus the probability to 
connect to one of them equals 

kn¥(D = k) - o(n) , . , 9 

^ ) TT ±- L =P% + o(l) > p* - e/K 2 , 

lin — o(n) 

given any history of the exploration. Hence, for every fixed j and k and 
given |7Yj_i | = hj-i, Nj^ dominates a sum of independent indicators with 
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the binomial distribution Bi(hj-i,p^ — e/K 2 ), and thus by the law of large 
numbers, whp 

(3.4) N jk >h^ 1 (p* k -2e/K 2 ), 

provided h 3 ~\ > logn, say. Moreover, the probability that a given vertex of 
degree k is chosen twice in the jth generation is 0(khj-i/n) 2 = ©(n 1 ^ 7 " 1 ) /n) 2 = 
o(n~ l ), so whp there are no repetitions and all these Nj k vertices yield k—1 
new active half-edges each. 

We consider only a finite number of (j,k), namely j < J and k < K, 
and thus whp the statements above hold for all these (j, k) simultaneously. 
Consequently, whp, for all j < J, 

K K 
\Hj\ > 5> - l)N jk > \H,-i\ l)(Pfc - 2e/K 2 ) > \Hj-i\(u K - e), 

k=l k=l 

and thus by induction (which also verifies /ij-i > logn) 

\H 5 \ > \H \(v K - e) j = (v K - e) j A, < j < J. 
By (3.4) again, it follows that whp, for 1 < j < J and 1 < k < K, 

N jk >{v K -ey- l &{pl-2e/K 2 ), 
and thus, summing over these j and k, 

\C(v*)\ > Y,{v K - sy- 1 J2(pt - 2e/K 2 ) A > 1 7 ( ^ " ^ (1 - 3e)A. 
3=1 k=i l-PK + e 

For any small rj > 0, we may choose e small and J and K large so that this 
yields, recalling (1.4), 

\Cx\ > \C(v*)\ > ( -1— - V )a> -A_ _ r,C 2 n l /^-V 
\1 — v J 1 — v 

whp, which yields the required lower bound and completes the proof. 

4. Extensions to other random graphs. Theorems 1.1 and 1.3 can eas- 
ily be extended to other random graph models. We will in this section 
study some models where we can obtain results as corollaries of the re- 
sults above for G(n,(dj)™). (Another approach would be to try to adapt 
the proofs above to other models, but we will not pursue this possibility 
here.) Indeed, SUppOSG tlleit Gfi is a random graph with n vertices (labeled 
l,...,n), such that G n conditioned on the degree sequence (dj(G n ))™ =1 is 
uniformly distributed over all graphs with this degree sequence. Equiva- 
lently, P(G n = H\) = P(G n = H2) for any two graphs H± and H2 with ver- 
tices 1, . . . ,n and the same degree sequence. (In statistical terminology, the 
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degree sequence is a sufficient statistic.) We then can apply the theorems in 
Section 1 to G n conditioned on the degree sequence. 
Define, in analogy with the notation above, 

(4.1) MG„) = -X>(G n ), 

1 n 

(4.2) u(G n ) = —^J2 d i( G n)(di(G n ) - 1), 

(4.3) A(G n ):=maxdi(G n ), 

i 

and note that these are random variables depending on the degree sequence 
of G n only. Further, let C\{G n ) be the largest component of G n (with any 
choice among them if there is a tie) . 

Theorem 4.1. Let 7 > 3. Suppose that, for n> 1, G n is a random graph 
with n vertices such that G n conditioned on the degree sequence is uniformly 
distributed over all graphs with this degree sequence. Suppose further that for 
every e > there exists 5 > such that 

(4.4) pfsupF" 1 !^:^^) > k}\/n>5'A <e, 

\k>l J 

(4.5) P(KG n ) < S) < e, 

(4.6) P(v(G„) >l — 5)<£. 

(In other words, sup fe>1 k 1 ~ 1 \{i : di(G n ) > k}\/n, l/fi(G n ) and 1/(1 — z^(G n ))_|_ 
are stochastically bounded.) Then 

A(67 ra ) ( ^ ^1/(7-1), 
1 - is(G n ) 

and, in particular, |Ci(G n )| = O p (n 1/<7 1} ). 

Proof. First, by conditioning on the degree sequence, we may assume 
that the events 

(4.8) supJfe' y - 1 |{i:d i (G n ) > k}\/n< S~\ 

k>l 

(4.9) (i{G n ) > S, 

(4.10) p(G n )<l-5 

hold (surely) for some 5 > 0. Indeed, if we have proved this case, we may 
condition on these events and see that the conclusion then holds; by (4.4)- 
(4.6), this leaves only an event of probability < 3e, which can be made 



( 4 - 7 ) |Ci(G n )| = - 777 ^ l q +o p [n ) 
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arbitrarily small, and the conclusion follows for G n without conditioning 
also. 

Second, the Cauchy-Schwarz inequality implies, using (4.10), 



and thus n(G n ) < 2. Hence, both fJ,(G n ) and u{G n ) are sequences of ran- 
dom variables that are uniformly bounded, and thus tight; thus there exist 
subsequences such that fJ,(G n ) and v{G n ) converge in distribution. We now 
use the standard subsubsequence trick: if every subsequence of (G n ) has 
a subsubsequence for which (4.7) holds, then (4.7) holds for the full se- 
quence. Consequently, by considering subsequences, we may assume that 

n) y Z^oo and v{G n 

) — ► z^oo for some random variables /Xoo 

and Uoo, 

with 5 < [loo < 2 and < < 1 — <5. 

Third, by the Skorohod coupling theorem ([13], Theorem 4.30), we may 
assume that all random graphs G n are defined on a common probability 
space and that the limits /x(G n ) — ► /ioo and v(G n 

) — > ^oo hold a.s. and not 
just in distribution. We now condition G n on its degree sequence (di(G n ))f =1 . 
By the reductions above, the assumptions of Theorem 1.1 are satisfied a.s. 
Hence Theorem 1.3 shows that for every 77 > 



and the result follows by taking the expectation in (4.11) (using dominated 
convergence) . □ 

As an illustration, we apply this theorem to two popular models of random 
graphs. 

4.1. The Newman, Strogatz, Watts model. Durrett [7] actually stated his 
conjecture for the following random graph model, introduced by Newman, 
Strogatz and Watts [17, 18]. Let D be a nonnegative integer- valued random 
variable. Take i.i.d. copies D\,D%, ••• of D. Condition on Y^l=\Di being 
even, and let G n be the random multigraph G*(n,(Di)^), defined by the 
configuration model as in Section 2. (This is a multigraph. We may either 
accept this, or delete all loops and merge parallel edges, or condition on the 
graph being simple; this does not affect our result.) 



Corollary 4.2. Let 7 > 3. Assume that F(D >k) = (^(A; 1 " 7 ) and v := 
KD(D — l)/ED < 1. Then 



(MGn)) 2 < -J2 d ^ G n? = KGnHGn) + fi(G n ) < 2/i(G n ), 




a.s. 



Ci(G n )\ 



A(cy 



+ o p (n 



1/(7-1) 



) 



1-1/ 
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Remark 4.3. If further, as assumed by Durrett [7], P(D = k) ~ ak 7 as 
k — ► oo, with a > 0, then P(L> > &) ~ aiA; 1-7 with ai = a/(j — 1), and it is 

easily seen that n -1 ^ 7-1 ) A(G„) a} /(7_1) Z, with P(Z < z) = exp^x 1 " 7 ) 
for x > 0, one of the classical extreme value distributions. (See, e.g., Lead- 
better, Lindgren and Rootzen [14], Chapter 1 and Example 1.7.6.) Conse- 
quently, 

1/(7-1) 

„-i/(7-i)| Cl(G . B )|_!U£l Z . 

1 — V 

Proof of Corollary 4.2. Let fi = ED. The case \i = is trivial, 
so we may assume fi > 0. The assumption P(Z) > k) = 0(/c 1-7 ) with 7 > 
3 implies that ED 2 <oo, so < /i < 00 and < v < 00. The (weak) law 

of large numbers shows that Ya=i Di/n — ^ fj, and X)it=i D%(P% ~ l)/ n 
EZ)(D — 1) = /xz/, and these hold also if we condition on Ya=i Di even, which 
has probability converging to 1/2 [or 1, if P(-Dj odd) =0]. Hence, using the 
notation (4.1) and (4.2), fx{G n ) \i and v{G n ) — z/, which yields (4.5) 
and (4.6) for any 5 < min(/i, 1 — v). This also implies 

A(G n ) A(G n ) _ u(G n )-u A(n ,_ ,1/(7-1), 



l_„(G n ) (1-i/)(1-i/(G„)) 

so we may replace v{G n ) by 1/ in (4.7). 

It remains only to verify (4.4). Again, it suffices to show the corresponding 
result for the i.i.d. sequence (-Dj)i before conditioning. Let N n {x) := \{i < 
n:Di >x}| and let £(B,x) be the "bad" event {N n (x) > Bnx 1 ^ 1 }, for real 
x,B > 0. Then N n (x) ~ Bi(n,P(L> > a;)), and by assumption F(D > x) < 
Cigx 1 " 7 . Assume that B > 2C W , so EN n (x) = n¥(D > x) < \Bnx x ^ and 
VariV n (x) < EiV n (x) < ^Unx 1-7 . Then, by Chebyshev's inequality, 

(4 , 2) P(£(B ,,)) < ,„ y-w) < ,,^"M> < 2 



CBnx 1 " 7 -EiV n (x)) 2 ~ ((l/2) J Bnz 1 - 7 ) 2 ~ Bnx l ~^' 
Also, since N n {x) is integer- valued, 

(4.13) P(£(£, x)) < F(N n (x) > 1) < E(iV n (x)) < dgnx 1 " 7 . 

We consider a geometric sequence and let Xj := (2 3 'n) 1 ' ( 7_1 ), j £ Z. By (4.12) 
and (4.13), 

(4.14) <max^^-,C 19 2-^. 

Hence, each P(£ (B,Xj)) — > as -B — ► 00; moreover, (4.14) further shows that 
P(£(J3,Xj)) < C2o2~l J *l Consequently, dominated convergence shows that 



J2 W(£(B,Xj))->0 as B 



00. 
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In other words, by choosing B large enough, with probability at least 1 — e 
none of S(B,Xj) holds, that is, N n (xj) < Bnx^~^ for all j € Z, which implies 
N n (x) < IBnx 1 " 1 for all x > 0. [Use N n (x) < N n (xj) for the largest Xj < x.) 
This verifies (4.4), and the result follows by Theorem 4.1. □ 

A minor variation of this model has been used by van der Hofstad, 
Hooghiemstra and Van Mieghem [22]; the same result holds for their version. 

4.2. An inhomogeneous Erdos-Renyi graph. A model that has been used, 
with minor variations, by several authors is the following: Let W±, . . . , W n be 
nonnegative real numbers, random or deterministic. Given these numbers, 
let 

(4.15) Pli := W ' W ' , 

and let G n be the random graph on n vertices where [conditioned on (Wj)™] 
edges appear independently, with P(an edge ij) = ptj for 1 < % < j < n. It 
was observed by Britton, Deijfen and Martin-L6f [3] that with the definition 
(4.15), this yields a random graph that conditioned on its degree sequence 
is uniformly distributed, as we require here. 

A version of this model, which may look simpler, is to replace (4.15) by 
Pij :=WiWj/n [or rather, when necessary, min(WjWj/n, 1)]. However, this 
ruins the uniformity after conditioning on the degree sequence. Nevertheless, 
the difference between the two versions is small, unless some Wi are very 
large, and usually the difference is negligible. More precisely, it can be shown 
[10] that if Ya=i Wf = o p (n 3//2 ), which for example is the case if, as below, Wi 
are i.i.d. with EWf < oo, then the two versions are asymptotically equivalent 
in a strong sense (the total variation distance tends to 0) , and thus the result 
below holds for this version also. 

Another version, introduced by Chung and Lu [4, 5], is to take p^ := 
WiWj/J2k=i Wfc- This becomes the same as the preceding version if we 
replace Wi by WiQ^ k Wk/n)~ l l 2 . Again, results are easily transferred to 
this version; we omit the details. Another minor (and for us inessential) 
variation is used by Norros and Reittu [19] , and several versions are studied 
by van den Esker, van der Hofstad and Hooghiemstra [21] . 

It is common to take (Wi)" i.i.d. (e.g., this is used in [3, 19, 21]), but it 
is also possible to use suitable deterministic sequences [4, 5] or dependent 
random variables; see Bollobas, Janson and Riordan [2], Section 16.4, for a 
general discussion and further references. (The model is a special case of the 
inhomogeneous random graph defined in [2].) For simplicity we treat here 
only the i.i.d. case. The condition for existence of a giant component then 
simply is EWj 2 > 1; see Bollobas, Janson and Riordan [2], Section 16.4. 
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Corollary 4.4. Let G n be the random graph defined as above, using 
(4-15) where Wi are i.i.d. copies of a given nonnegative random variable 
W. Let 7 > 3. Assume that ¥(W >x) = OOr 1 " 7 ), x > 0, and v := EW 2 < 1. 
Then 



A(G? n ) | ^ ^1/(7-1), 
1 - v 



\C 1 (G n )\ = ^l + o p (n 



Proof. We may assume EW > 0. Since n/j,(G n ) = J2idi(G n ) is twice 
the number of edges in G n , it is easily seen that n{G n ) — — > /U := (EW) 2 > 
which verifies (4.5); for example, this follows from the more general Lemma 
8.1 in [2]. 

Similarly, for example by observing that J2idi{G n )(di(G n ) — 1) is twice 
the number of paths of length 2 in G n and using [2], Theorem 17.1, 

1 n 

^(G„H(G?„) = -J^MGnXMGn) - 1) {EWfEW 2 . 

Hence, v(G n ) — v := EW 2 , which verifies (4.6). This also enables us to 
replace v(G n ) by v in (4.7). 

It remains to verify (4.4). First, by the argument in the proof of Corol- 
lary 4.2, for any e > 0, 

(4.16) \{i <n:Wi> k}\ < Bnk 1 ' 1 for all k > 1, 

with probability > 1 — e, provided B = B{e) is large enough. Hence, condi- 
tioning on (4.16), we may assume that (4.16) holds. We condition on (Wi)" 
and assume thus in the remainder of the proof that (for each n) Wj are 
fixed numbers such that (4.16) holds, for some constant B. In particular, 
this implies 

n oo oo 

(4.17) J2 Wi = J2\^- n:Wi - k }\ - J2 Bnk 1 "^ < C 2 m. 

i=l k=l k=l 

Each di(G n ) is a sum of independent 0-1 variables, and by (4.15) and 
(4.17) 

WiWi WiY^iWj 

Edi{G n ) = ' ' < 3 < C 2 iWi. 

^ n + WiWj n 



As a consequence, we have the Chernoff estimate (see [12], Corollary 2.4 and 
Theorem 2.8) 

(4.18) P(<*i(Gn)>t)<exp(-t), t>7C 2l Wi. 

Let j > 1 and let 

Ij :={i<n:Wi< (7C 21 )~ 1 ^}, /j := {1, . . . ,n} \ l r 
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By (4.16), | J<| < C 2 2^ {1 ~ l)j n. Further, let I* be the random set {i G Ij : di(G n ) > 
V}. By (4.18), 

E|J*| < exp(-2 J ')|/ j | < exp(-2 J ')ra < C 2 ^~ lj n, 
and thus, for every e > 0, 

P(|/;| > e- 1 C 23 2^-^ j n) < e2- j . 

Hence, with probability > 1 — e, |ij| < e~ 1 C 2 £2^~^ 3 n for all j > 1, and 
then, assuming e < 1, 

|{i <n:^(G n ) > 2 J '}| < \I*\ + |Jj| < £ - 1 C 24 2( 1 -^n, j > 1, 

which implies (4.1) and completes the proof. □ 
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